Image Captioning with Synergy-Gated Attention and Recurrent Fusion LSTM

You Yang; Lizhi Chen; Longyue Pan; Juntao Hu

연구문헌

영문 논문지

홈 > 연구문헌 > 영문 논문지 > TIIS (한국인터넷정보학회)

TIIS (한국인터넷정보학회)

Current Result Document : 7 / 108 이전건 다음건

한글제목(Korean Title)	Image Captioning with Synergy-Gated Attention and Recurrent Fusion LSTM
영문제목(English Title)	Image Captioning with Synergy-Gated Attention and Recurrent Fusion LSTM
저자(Author)	You Yang Lizhi Chen Longyue Pan Juntao Hu
원문수록처(Citation)	VOL 16 NO. 10 PP. 3390 ~ 3405 (2022. 10)
한글내용 (Korean Abstract)
영문내용 (English Abstract)	Long Short-Term Memory (LSTM) combined with attention mechanism is extensively used to generate semantic sentences of images in image captioning models. However, features of salient regions and spatial information are not utilized sufficiently in most related works. Meanwhile, the LSTM also suffers from the problem of underutilized information in a single time step. In the paper, two innovative approaches are proposed to solve these problems. First, the Synergy-Gated Attention (SGA) method is proposed, which can process the spatial features and the salient region features of given images simultaneously. SGA establishes a gated mechanism through the global features to guide the interaction of information between these two features. Then, the Recurrent Fusion LSTM (RF-LSTM) mechanism is proposed, which can predict the next hidden vectors in one time step and improve linguistic coherence by fusing future information. Experimental results on the benchmark dataset of MSCOCO show that compared with the state-of-the-art methods, the proposed method can improve the performance of image captioning model, and achieve competitive performance on multiple evaluation indicators.
키워드(Keyword)	Image captioning Synergy-Gated Attention Recurrent Fusion LSTM Deep learning
파일첨부	PDF 다운로드